Online and Incremental Mining of Separately-Grouped Web Access Logs

نویسندگان

  • Yew Kwong Woon
  • Wee Keong Ng
  • Ee-Peng Lim
چکیده

The rising popularity of electronic commerce makes data mining an indispensable technology for business competitiveness. The World Wide Web provides abundant raw data in the form of web access logs, web transaction logs and web user profiles. Without data mining tools, it is impossible to make any sense of such massive data. In this paper, we focus on web usage mining because it deals most appropriately with understanding user behavioral patterns which is the key to successful customer relationship management. Previous work deals separately on specific issues of web usage mining and make assumptions without taking a holistic view and thus, have limited practical applicability. We formulate a novel and more holistic version of web usage mining termed TRAnsactionized LOgfile Mining (TRALOM) to effectively and correctly identify transactions as well as to mine useful knowledge from web access logs. We also introduce a new data structure, called the WebTrie, to efficiently hold useful preprocessed data so that TRALOM can be done in an online and incremental fashion. Experiments conducted on real web server logs verify the usefulness and practicality of our proposed techniques.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effective web log mining and online navigational pattern prediction

The web has become the world's largest repository of knowledge. Web usage mining is the process of discovering knowledge from the interactions generated by the user in the form of access logs, cookies, and user sessions data. Web Mining consists of three different categories, namely Web Content Mining, Web Structure Mining, and Web Usage Mining (is the process of discovering knowledge from the ...

متن کامل

Web-Page Recommendation Using an Enhanced Incremental Sequence Mining Algorithm Along With Ontology

Web page recommendation is a process to recommend appropriate web pages to the user according to the user interest.When user is on a webpage they should get a proper recommendation so that they gain relevant results. Appropriate knowledge discovery from Web usage data and correct representation of that knowledge for successful Web -page recommendation is important. The paper presents a techniqu...

متن کامل

Data Mining Techniques to Discover Students Visiting Patterns in E-learning Resources

In recent times, the rapid progress of internet technology has triggered the extensive development of web-based learning environments in the educational world. Online learning resources provide various types of online learning assets like tutorials, e-books, scientific articles, etc. Nowadays students prefer Elearning resources for learning and collecting useful information through it. As stude...

متن کامل

تشخیص ناهنجاری روی وب از طریق ایجاد پروفایل کاربرد دسترسی

Due to increasing in cyber-attacks, the need for web servers attack detection technique has drawn attentions today. Unfortunately, many available security solutions are inefficient in identifying web-based attacks. The main aim of this study is to detect abnormal web navigations based on web usage profiles. In this paper, comparing scrolling behavior of a normal user with an attacker, and simu...

متن کامل

Mining Access Patterns Eeciently from Web Logs ?

With the explosive growth of data available on the World Wide Web, discovery and analysis of useful information from the World Wide Web becomes a practical necessity. Web access pattern, which is the sequence of accesses pursued by users frequently, is a kind of interesting and useful knowledge in practice. In this paper, we study the problem of mining access patterns from Web logs e ciently. A...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002